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ABSTRACT 


The  statistical  distribution  of  biological  phenomena 
is  generally  assumed  to  be  normal  or  Gaussian.  In  some 
instances,  however,  the  distribution  is  lognormal,  that 
is,  the  logarithm  of  the  variable  is  normally  distributed. 
This  report  presents  a technique  for  determining  the  mean 
and  standard  deviation  of  these  distributions  from  the 
cumulative  distribution  function.  The  actual  distribution 
is  compared  against  a standardized  cumulative  distribution 
function  of  mean,  5,  and  standard  deviation,  1,  (probit 
transformation).  The  relationship  between  the  random  vari- 
able and  the  probit  is  found  by  linear  regression  and  the 
corresponding  mean  and  standard  deviation  determined.  Two 
programs  are  presented  for  solving  this  problem,  one  writ- 
ten for  a Hewlett-Packard  9820A  programmable  calculator 
with  plotter,  and  the  other  written  for  a digital  computer 
in  Fortran  IV.  As  an  example  of  the  use  of  these  programs, 
the  platelet  size  distribution  in  a fresh  blood  sample, 
obtained  from  rats  subjected  to  hyperbaric  exposures  and 
subsequently  decompressed,  is  solved. 
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INTRODUCTION 


The  statistical  distribution  of  biological  phenomena  is  generally 
assumed  to  be  a normal  or  Gaussian  distribution.  In  certain  instances, 
howe.er,  the  distribution  can  be  asymmetric  with  the  logarithm  of  the 
variable  obeying  the  normal  law  of  probability.  This  distribution  is 
commonly  described  as  a lognormal  distribution.  An  example  is  the 
platelet  volume  distribution1’  in  a fresh  blood  sample.  Many  other 
examples  exist  in  a variety  of  fields,  most  notably  in  economic  data 
where  the  lognormal  distribu  .on  is  the  rule  rather  than  the  excep- 
tion3 . 


The  basic  problem  when  one  has  a normal  or  lognormal  distribution 
is  to  obtain  the  parameters  for  the  distribution,  that  is,  the  mean 
and  the  standard  deviation.  If  the  information  is  presented  as  the 
non-cumulative  probability,  and  a great  number  of  observations  are 
available,  it  is  possible  to  fit  the  distribution  by  standard  techni- 
ques such  as  the  least  squares  method  for  finding  the  mean  and  the 
standard  deviation.  However,  if  the  information  is  presented  as  the 
cumulative  probability,  that  is,  the  probability  that  the  observation 
is  less  than  or  equal  to  some  value,  the  problem  becomes  more  diffi- 
cult to  solve.  The  cumulative  distribution  function  is  a sigmoid 
curve.  If  a sufficient  number  of  observations  is  used  to  specify 
this  curve,  it  is  possible  to  extract  a noncumulative  distribution 
and  use  standard  techniques.  If  the  number  of  observations  is  not 
large,  then  one  must  resort  to  other  methods. 

In  this  paper,  a technique  for  determining  the  mean  and  standard 
deviation  from  a cumulative  distribution  function  is  presented  where 
the  actual  distribution  is  compared  to  the  standardized  cumulative 
distrubution  function.  This  technique  can  be  easily  programmed  on 
programmable  calculators  or  on  a digital  computer,  and  a calculator 
program  and  a Fortran  IV  listing  are  presented.  The  same  principle 
has  been  used  previously  by  Servantie  et  al7  but  the  version  presen- 
ted here  is  designed  for  much  faster  operation  with  less  calculations. 

METHOD  OF  CALCULATION 

The  general  equation  for  the  noncumulative  probability  density 
function  (p.d.f.)  is  given  by 
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where  x is  the  measured  variable 
u is  the  mean,  and 
a is  the  standard  deviation. 

The  cumulative  distribution  function  (c.d.f.)  is  given  by 


F (x) 


(2) 


where  F is  the  probability  that  the  random  variable  is  less  than  or 
equal  to  x.  The  standardized  p.d.f.  is  defined  for  p = 0 and  o = 1 
and  tables  giving  both  the  p.d.f.  and  c.d.f.  are  readily  available. 

In  the  case  of  the  lognormal  distribution,  the  logarithm  of  the 
variable  x is  normally  distributed.  The  p.d.f.  is  given  by 
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(x>0  only) 
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and  the  c.d.f.  by 
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(4) 


The  simplest  method  to  determine  the  mean  and  the  standard  devia- 
tion from  the  c.d.f.  is  to  use  normal  or  lognormal  probability  paper, 
depending  on  the  distribution.  The  c.d.f.  will  plot  as  a straight 
line  and  it  is  a simple  matter  to  determine  the  mean  at  the  50%  proba- 
bility point  and  the  standard  deviation  from  the  84.1%  probability 
point.  If  better  accuracy  is  required  or  if  the  observations  have 
sufficient  scatter  that  a linear  regression  must  be  carried  out  on  the 
data,  the  graphical  analysis  is  inadequate. 

The  method  presented  here  compares  the  observed  c.d.f.  against 
the  standardized  c.d.f.  The  probit  transformation 


P = x + 5 


(5) 


is  used  to  avoid  negative  values  so  that  the  standardized  c.d.f. 
becomes 
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with  the  mean  at  5.  For  every  x at  which  a probability  F is  observed, 
there  is  a corresponding  value,  P,  from  the  standardized  c.d.f. 
(equation  6)  for  the  same  value  of  F.  If  the  observed  distribution 
is  normal,  the  relationship  between  x and  P will  be  linear. 

P = ax  + b (7) 

and  if  the  distribution  is  lognormal 

P = a lnx  + b.  (8) 

The  slope,  a,  and  the  intercept,  b,  can  be  obtained  by  linear 
regression  from 
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where  n is  the  total  number  of  observations  and  X is  either  x or  lnx 
depending  on  the  distribution. 

The  mean,  m,  can  be  found  from  P = 5,  hence 

m = (5  - b)/a  (1] 

and  the  standard  deviation,  s,  from 

s = 1/a.  (i; 


For  the  lognormal  distr ibutionb , it  is  useful  to  have  also  the  mode 
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the  median 


median 


m 


e 


(14) 


and  the  expectation  value 
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Although  the  method  is  straightforward,  the  problem  lies  in 
determining  P for  a given  value  of  F (from  equation  6) . Since  the 
probability  integral  can  only  be  solved  numerically,  i.e.  F as  a 
function  of  P,  the  inverse  problem,  P as  a function  of  F,  is  diffi 
cult  to  solve.  Servantie  et  al7  use  an  iterative  process  with 


Ax 


by  subdividing  Ax  until  the  calculated  F is  sufficiently  close  to  the 
observed  F and  then  determining  the  value  of  x.  The  process  is,  how- 
ever, extremely  slow  for  use  on  programmable  calculators. 

The  technique  shown  here  is  based  on  a rational  approximation  to 
the  inverse  probability  integral.  If  the  probability  integral  is 
written  as 


with  0<Y<0.5,  then 
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where  | 

t (Y ) 1 < 4.5  x 

10~4 

and 

t 

= y ln(l/Y2) 

(18) 

The 

set 

of  constants, 

C and 

d,  are  given  by 

c0 

= 2.515517 

d,  = 

1.432788 

Cl 

= 0.802853 

d2  = 

0.189269 

C2 

= 0.010328 

d3  = 

0.001308 

The 

probit,  P,  can  then  be 

determined 

from  equations  16  and  17 

for  the 

following  ranges  of  the  observed  values,  F, 

0<F£0.5,  Y = F, 

0. 5<F<1.0,  Y = 1 - F, 

EXAMPLE 

The  technique  presented  here  has  been  used  to  find  the  mean  and 
standard  deviation  of  platelet  sizes  in  blood  samples  obtained  from 
rats  which  were  subjected  to  hyperbaric  exposures  and  subsequently 
decompressed4.  The  data  have  been  obtained  on  a Coulter  Counter  and 
Channelizer.  Table  I shows  the  data  obtained  from  six  samples  of 
blood . 

Programmable  calculator  solution  - Appendix  I contains  a listing  of  a 
program  written  for  a Hewlett-Packard  9820A  Calculator  with  plotting 
facilities.  Appendix  II  gives  instructions  on  its  use,  and  a flow 
chart  is  shown  in  Figure  1.  Data  points  (up  to  30  observations)  are 
input  from  the  keyboard,  the  probabilities  and  probits  calculated,  and 
stored  in  memory.  Provisions  are  made  for  correcting  errors  after  all 
the  data  have  been  entered.  Either  a normal  analysis  or  lognormal 
analysis  can  be  selected  from  the  keyboard.  A plot  of  the  data  and 
calculated  fit  to  the  data  is  optional. 

Appendix  III  shows  a listing  of  the  input  data  and  the  calculated 
results  fcr  a lognormal  analysis.  The  data  are  plotted  in  Figure  2 
and  it  can  be  observed  that  the  probit  values  (indicated  by  "+")  when 
plotted  against  In  x are  linear  except  for  the  last  four  observations. 
This  departure  from  linearity  results  since  the  Coulter  can  not  dif- 
ferentiate between  large  platelets  and  small  red  blood  cells  which  are 
present  in  this  range.  The  program  has  provisions  for  omitting  these 
observations  from  the  analysis.  The  results  in  Appendix  III  are  based 
on  the  first  15  observations  only.  The  solid  lines  in  Figure  2 show 
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the  values  calculated  for  the  probit  and  the  c.d.f.  using  the  computed 
parameters  a,  b,  m and  s.  The  c.d.f.  is  calculated  by  an  iterative 
method.  The  results  indicate  that  the  distribution  of  platelet  sizes 
is  indeed  lognormal. 

The  program  has  provisions  for  repeating  the  analysis  with  a dif- 
ferent number  of  omitted  observations,  if  for  example,  a better  fit  is 
required  for  the  experimental  data.  If  a repeat  is  not  required,  then 
the  program  branches  back  to  the  location  for  selecting  either  the 
normal  or  lognormal  analysis  again,  or  for  terminating  the  program. 
Appendix  III  also  shows  results  for  a normal  distribution  analysis. 
Figure  3 shows  that  the  platelet  sizes  are  not  normally  distributed. 

Digital  computer  solution  (FORTRAN  IV)  - Appendix  IV  shows  a Fortran 
IV  listing  for  use  on  a digital  computer.  The  data  is  prepared  as  an 
input  file  on  a peripheral  device  in  the  format  shown  in  Appendix  V. 

Up  to  100  observations  can  be  used,  although  this  can  be  easily  changed 
by  modifying  the  program.  The  program  is  an  interactive  one  where  the 
operator  can  choose  a particular  input  data  file  and  normal  or  log- 
normal analysis  from  the  teletype.  If  a line  printer  is  used  for  out- 
put, then  this  must  be  signified. 

Appendix  VI  shows  the  results  (teletype  output)  for  the  lognormal 
analysis  of  the  platelet  data.  Initially,  the  parameters  are  calcula- 
ted using  all  the  observations  and  printed  out  on  the  output  device. 

The  probit,  calculated  from  the  computed  values  of  the  slope,  a,  and 
intercept,  b,  is  printed  out  as  the  difference  between  the  calculated 
and  observed  values,  since  a plotter  is  not  used.  These  differences 
and  the  correlation  coefficient  can  be  used  to  tell  whether  the  analy- 
sis selected  is  appropriate  (i.e.,  normal  or  lognormal).  If  there  is 
some  doubt,  then  the  analysis  can  be  repeated  with  the  other  type  of 
distribution  and  compared.  The  correct  analysis  will  give  a larger 
correlation  coefficient. 

The  differences  between  the  calculated  and  the  observed  values 
at  the  tails  of  the  distribution  can  be  examined  to  determine  whether 
any  observation  should  be  discarded.  The  program  asks  for  the  number 
of  observations  to  be  omitted  at  the  beginning  and  end  of  the  data 
file.  The  program  then  calculates  new  parameters  and  the  probit  dif- 
ferences. The  analysis  can  be  repeated  with  different  numbers  of  ob- 
servations to  be  discarded,  if  desired. 

When  the  analysis  is  to  be  terminated,  the  observed  data,  the 
c.d.f.,  the  probit  values,  and  the  differences  between  the  observed 
c.d.f.  and  those  calculated  from  the  mean  and  standard  deviation  are 
Pr^n^ec^  out.  The  c.d.f.  calculated  from  the  mean  and  standard  devia- 
tion uses  the  rational  approximation2 


Y(x) 


1 - Z(x)(ait  + a2t2  + a3t'1)  + c(x) 


where  0<x<“ 

t = l/(l+px) 

| e(x) |<1  X 10"5 


Z(x) 


Y(x) 


and  p = 0.33267 
ax  = 0.4361836 
a2  = -0.1201676 
a 3 = 0.937298 

The  agreement  between  the  calculated  c.d.f.  and  the  observed 
c.d.f.  is  excellent.  After  the  printout  is  completed,  the  program 
will  request  a new  input  file. 

Appendix  VII  shows  the  same  data  treated  as  a normal  distribution 
(line  printer  output).  With  the  line  printer  output,  the  correlation 
coefficient  and  probit  differences  are  also  printed  on  the  teletype. 
The  correlation  coefficient  is  less  than  that  for  the  lognormal  dis- 
tribution indicating  that  the  normal  distribution  is  a poor  fit  to  the 
observed  distribution.  This  is  borne  out  by  the  large  errors  between 
the  calculated  and  observed  values  of  the  probit  and  the  poor  agree- 
ment between  the  observed  and  calculated  probabilities. 


DISCUSSION  AND  SUMMARY 

Of  the  two  methods  presented  here,  the  programmable  calculator 
method  is  probably  the  most  useful  for  most  applications  since  a 
graphical  plot  is  available  and  it  is  easy  to  see  which  data  points 
are  spurious  and  how  well  the  calculated  values  agree  with  the  experi- 
mental observations.  It  is  limited  in  the  amount  of  data  that  can  be 
input  because  of  the  size  of  the  memory  available  in  the  calculator 


H 


for  data  storage.  The  program  can  be  easily  rewritten  to  calculate  j 

the  various  sums  continuously  so  that  the  input  data  need  not  be 
stored.  Hence  an  unlimited  number  of  observations  can  be  used.  How- 
ever, it  would  not  be  convenient  to  omit  data  points  which  may  be 
spurious  as  in  the  example  where  the  red  blood  cells  inflate  the  ob- 
served platelet  count.  With  the  present  program,  also,  the  data  can 
be  saved  on  a magnetic  card  for  future  use  if  necessary. 

The  digital  computer  program  is  better  suited  for  analyzing 
samples  where  a large  number  of  observations  are  available  since  the 
calculations  can  be  done  much  faster.  If  a large  number  of  samples 
are  to  be  analyzed,  all  the  samples  can  be  prepared  beforehand  as 
separate  input  files  and  edited  for  mistakes  before  running  them 
through  the  program.  The  data  will  also  be  available  for  reanalysis 
at  a later  time  if  necessary  without  having  to  key  in  all  the  data 
again.  A disadvantage  of  the  program  is  the  lack  of  a graphical  in- 
dication of  the  observed  values  and  the  calculated  values.  The  pro- 
gram can  be  easily  modified  if  a digital  XY  plotter  is  available. 

Both  programs  described  here  provide  a much  more  convenient 
method  for  finding  the  mean  and  standard  deviation  from  a cumulative 
probability  distribution  than  other  standard  techniques  such  as  gra-  / 

phical  methods.  The  use  of  rational  approximations  for  calculating 
the  probability  integral  and  its  inverse  provide  sufficient  accuracy 
for  the  type  of  data  generally  available,  and  also  allows  the  calcu- 
lations to  proceed  much  faster  than  with  iterative  numerical  methods. 

Although  the  example  shown  here  is  for  the  analysis  of  platelet  sizes 
in  a fresh  blood  sample,  the  technique  can  be  applied  to  any  sort  of 
data  which  may  be  normally  or  lognormally  distributed. 
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Figure  1:  Flow  chart  for  calculator  program 
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TABLE  I - PLATELET  SPECTRA  FROM  RAT 


SUBJECTED  TO  DECOMPRESSION  AFTER 
HYPERBARIC  EXPOSURE 

SIZE 

ACCUMULATIVE  COUNT 

16 

8545 

21 

24432 

26 

49667 

31 

81915 

36 

116328 

41 

149879 

46 

180259 

51 

206747 

56 

229375 

61 

248784 

66 

264744 

71 

278218 

76 

289956 

81 

300032 

86 

309024 

91 

316741 

96 

323697 

101 

329812 

106 

335196 

TOTAL  COUNT 

339075 

APPENDIX  I Program  listing  for  HP  9820A  calculator 
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H 0 . 0 B S . U S E B ' » R 

UM.  TOTAL"  »C5 

i 0 « 

3 5 S F'  C 8 3 I F F L L>  c 

F X D i F 

CFG  1 3 5 1 -> B 5 S P U 3 

3 G TO  + 3 F 

6 ■ 

5 0 R 4 R 5 R 6 h R ? * R 

.j  b ■ 

F'  R i X ? C 5 F a B 4 ? 

8 -i  R 9 -*  R 1 OF 

GSB  ' SCL  2 ''  F 

PRT  C R 2 0 - R ( 50  + B 

2 O : 

34: 

j -mjF 

R ( 20  + B ) *X  5 LN  X-iZ 

USB  “ B " F 

“7  m 

5 R ( 8 O B I •h  Y F 

~i  cr  > 

.“l  . K 

C •>  X 5 I F C > . 5 5 1 - C * 

21  : 

ENT  "REPEAT  Y " » P 

v •<  i p y,  '■ " o a b ] u i 3 P 

R 4 + X 4 R 4 3 k 5 + X X * R 'o 

? .1  F F L La  i b ? La  1 U 1 

5 R 8 + Z R 8 3 R 9 + Z Z •*  R 

5F 

f'LFi  i.  1 /XX  3 -AF 

9 F 

3 6 • 

9 : 

d.  d ■ 

G T O ! 8 F 

i"i  . F 0 + H iRi  Hi  r 2 .1  .1 

R i !■ i ■ . ■ P * L 'i  ♦ R • 

" * 

. i + H ( R 1 1 + fl  i R 1 2 •+ 

3 R 1 0 + 2 V ->  R 1 0 F 

" fc  ? .'F  G 1 3 j G S B 

H! ' 1 3 i J ' ->A  ‘ 1 F C'i  . 

V * j 

SCL  1 “ F 

G 5 -fl  * RF 

B+l+BSIF  BLR?? 

3 ; 

1 0 ; 

GTO  ~3F 

fi , . L 1 0 ? O ' 5 • . 1 3 . i 

■ ; )-H 

O i • 

d 4 » 

i f : : i if 

; ' H * b F C P 

1 ! ' i 5 1 4 - L 5 R 5 ~ . 

O Q 1 

.j  w rPT  ••hop.! in 

■ : ■ i 

■ i Y | 


B'"  AVAILABLE  COPY 


48 : 

•j  4 : 

L ! H 1 ? s . ■ J j v.:!  - 2 > 

" T'  ■■  J 

i.  , 

R 2 ! 4 X ? 5 4 A i K i. 

P L 1 ' P R 0 B H B i L I T V 

20  + F 

1 9 j 4 Y ? I F C = 2 

! 2 o 4 B ? F X D 0 P 

? LN 

r 2 1 4 x ; . 24 a; 

4i ; 

LN  P 

(20  + R19) 4 Y P 

LTR  y-2,.02,221? 

55 : 

F'LT  B510+B+B5IF 

F'LT 

X, R14K  + R15  5 X 

B L 1 0 0 5 G T O +0 P 

+ H4i- 

5 IF  X L Y 5 G T 0 

42: 

+ 0P 

L 1 R 6 0 , , 0 5 , J 3 1 5 

56 : 

F'LT  ” X “5  1 4 B P 

PEN 

5 GSB  "SCL  1 “ 

4 3 : 

P 

LTR  R ( 2 0 + B J - . 6 4 , 

cr  • ■ 

•J  j « 

P r 5 0 + B j - . 0 y 6 4 , 2 2 

- 5 0 4 

X 5 04R 1 8 5 IF  C 

1 ? F'LT  “X" 5B+1+B5 

= £ Jl  „ 

1 4 X P 

I h B i R 1 9 5 G T U + 0 P 

5 ft  n 

4 4 : 

14B? 

X 4 A ? IF  C = 2 5 

GSB  "SCL  2 " h 

LN  X 

4 H ? X 4 B P 

45: 

59 ; 

IF  C=25RXE  5 j 10j 

R18  + EXP  ( - . 5 ( I A - 

. 1 j i 5 . 0 i 9 2 4 R 5 1*B 

R 1 6 3 

/ R 1 7 ) 1 2 3 / B R 1 

? Ij  i U + 2 P 

7 r ( 2 

if  3 ->  R i S 5 IF  XL 

4 6 ; 

1 0 ; x 

4 1 4 X ? Ij  1 U “ 1 P 

fi  X E 11  a j 1 O j 5 j 1 5 . 

6 0 : 

6 4 4 H ? i ->  B P 

F'LT 

X , R 1 8 ? X + 1 4 x ? 

A * 7 - 
*+  1 '• 

IF  X 

2 R l 2 0 4 R i 9 i ? 

LTR  Z-. 03 C Z~X  3 j B 

GTG 

-2P 

,221?  F'LT  BS1+B  + B 

6 1 : 

5 I F B i 9 ? L T O + 0 P 

PEN 

P 

48 : 

62 : 

LTR  2-. 05 (Z-X) t 3 

RET 

P 

.5, 232?  F'LT  "F'ROB 

fci  O ■ 

IT” ? 2 . 5 4 B P 

"SCL 

i " ; scl  10,1 

49: 

10,0 

, ip 

IF  0 = 2?  F X D 15 

64 : 

L.  T R B - . 8 7 ? 9 . 7 , 2 2 

RET 

p 

1 5 P L T B 5 .S  + B-f-B? 

65: 

IF  B L 4 . 5 5 G T 0 +0P 

"SCL 

2"? IF  0=2? 

5 0 : 

S C L 

2 4 X , 5 4 z , 0 , 1 0 

IF  C=2? LTR  3.4,9 

? G T 0 

4 2 p 

.4?  F'LT  " LN  X " ? 1 4 

6 6 " 

BP 

SCL 

1 0 4 x ■<  1 104  2,0 

51  : 

, 1 0 P 

F:  < 2 0 + B ) 4 x ? I F C = 2 

6 7 : 

? L N R i.  2 0 + B i 4 X P 

RET 

P 

52: 

6 8 : 

LTR  X fl » R i.  o O + B .•  - 

END 

L 

r 

. 0t<4 , 22 1 ? Pi- T " + " 

R 1 48 

? B + 1.  4 B 5 IF  B 2 R 1 9 ? 

i 


APFLNDIX  II  instruction^  tor  use  ot  calculator  program 


STEP 

DISPLAY 

INSTRUCTIONS 

1 

ERASE,  LOAD,  EXECUTE,  Load  Side  1 

EXECUTE,  Side  2,  EXECUTE,  Side  3 

EXECUTE,  Side  4 

2 

END,  RUN  PROGRAM 

3 

TOTAL  COUNT 

Enter  Total  Count,  RUN  PROGRAM 

4 

X 

Enter  x - value,  RUN  PROGRAM 

5 

CUM.  TOTAL 

Enter  y-value,  RUN  PROGRAM 

6 

Repeat  steps  4 and  5 for  all  data  points 

7 

X 

To  terminate  input,  RUN  PROGRAM 

8 

CORRECT  ERROR? 

To  correct  error,  1,  RUN  PROGRAM 

If  no  errors,  RUN  PROGRAM  (Go  to  step  13) 

9 

DATA  No. 

Enter  data  number  of  incorrect  value, 

RUN  PROGRAM 

10 

X 

Correct  value  of  X,  RUN  PROGRAM 

11 

CUM.  TOTAL 

Correct  value  of  y,  RUN  PROGRAM 

12 

CORRECT  ERROR? 

If  further  errors,  go  to  step  8 

If  no  more  errors,  RUN  PROGRAM 

13 

NOR  = 1,  LOG  = 2 

For  normal  analysis,  1,  RUN  PROGRAM 

For  lognormal  analysis,  2,  RUN  PROGRAM 

14 

PLOT? 

If  plotter  used,  1,  RUN  PROGRAM 

If  plotter  not  used,  RUN  PROGRAM 

15 

OMIT?  OBS.  AT  END 

Enter  number  of  observations  to  be 
omitted,  RUN  PROGRAM 

If  no  omission,  RUN  PROGRAM 

16 

REPEAT? 

To  repeat  analysis  with  different  number 
of  omitted  observations,  1,  RUN  PROGRAM 
and  go  to  step  15 

To  terminate  analysis  or  to  try  other 
distribution,  RUN  PROGRAM 

17 

NOR  = 1,  LOG  = 2 

To  repeat  on  same  data,  go  back  to  step 

13 

To  terminate  program,  RUN  PROGRAM 

i 


r 


APPENDIX  III  Input  data  and  results  for  calculator  program 


BBLAVMMLCOPI 


9 

101.0 

DflTfl  HO, 

5 6 , 0 

3? 93 1 7 . 0 

X - V fi  L U £ 

229  3 7 5 . O 

. 9727 

CUM.  TOTfiL 

, 6 7 6 5 

6.9222 

C . D . F . 

5.4574 

PROBIT 

19 

1 

10 

106.0 

16.0 

61.0 

335196.0 

8545.0 

248784.0 

. 9886 

. 0252 

-7  . '-7 

8 I •«.*  1 

7.2759 

3 . 0 4 3 0 

5.62  3 7 

20 

1 1 

21.0 

6 6 . 8 

24432. 0 

264744.0 

.0721 

. 7808 

L 0 G H O R H H L 

3 . 5 3 9 1 

5, 7746 

P = R X + B 

12 

R 

26.0  , 

71.8  . 

1 . 9586 

4 9 667. 0 

278218.0 

B 

i 4 f c-; 

O 9 ft  r", 

c-  u 4 i u I 

3.-9433 

5.9173 

MEfiH--PROB I T=5 

•J  a «’’  b ( » 

4 

13 

MEDIAN 

3 1 , 0 

7 6 . 0 

44, 1525 

C;  1 9 1 c;  ft 

7 fi  9 9 5 6 , 6 

ST.  BE" . 

. 2416 

. 8551 

.5106 

4 . 2 9 9 1 

6 . 0 5 8 7 

MODE 

l 

34.0211 

5 

14 

EXP.  VALUE 

3 6.0  i 

8 1 . 0 

50.2  9 9 0 

116328.0 

3 0 8 0 3 2 , 0 

. 3431 

. 8849 

NO.  OBS.  USED 

4.5964 

6. 1997 

1 c, 

6 

15 

41.0 

8 6 . 8 

149879. 0 

3 0 9 0 2 4 . 0 

NORMAL 

, 4420 

.911  4 

4.8545 

6 . 3 4 9 5 

P = fl  X + B 

R 

—i 

r 

J.  tt 

4 6 . 0 

9 1 » 0 

B 

180259. 0 

3 16741. 0 

£ . 95 3 8 

. 5316 

. 9341 

MEAN- -PROBIT =5 

5.0791 

6 , 5 0 7 6 

58. 1447 

ST.  BE", 

1 "? 

24 . 5059 

S 1 ft 

'’I-,  ii 

209747. 0 

3 2 2 f . 0 

fir , 0!:  IJSEB 

. ft  2 


I 1 

1 


i 


APPENDIX  IV  Fortran  IV  listing  for  PDP-9T  computer 


i 

i 


j 


t 


C FILL  NOkMl)  F(jR  CALCULATING  THF  PARAMETERS  OF  A NORMAL 
C OR  lognormal  distribution 
C 

LOGICAL  found 

DIMENSION  fILE<2>*TITL(1A>#X(100>*Y<100)*XZ(100),PR(100) 
DIMENSION  pT I 100 ), DPR! 1P0)*DPT ( 100  > 

DATA  FjlE<2>/AH  SRC/ 

DATA  C0* Cl , CP/2.51 551 7# 0. 802853/0 *010328/ 

DATA  Dl* D2,D3/1 .432788.0* 189269/0*001308/ 

DATA  Al* a2>A3*AP/* 4361 836/ -*1201676/ *937258*  *3326  7/ 

C 


300 

C 

1 

301 

302 


303 

C 

2 

33A 

305 

C 


101 

201 


202 

3 

20  3 
A 

1 02 


2J4 


205 


WRITE  (b/300) 

FORMAT(50H  *IF  USING  LINE  PRINTER  • TYPE  1 FOR  Y£S/  0 FOR  NO) 
RE AD ( hi  305  ) LP 

*R I TE ( 6* 301  » 

F ORMAT ( ///2lH  *TYPE  DATA  FILE  NAME/) 

READ  I A/ 302 ) F ILF  < 1 ) 

FORMAT ( As ) 

CALL  FsTaT(5/FIlE*F0UND) 

IF  (FOUND)  GO  TO  2 
*RITE<6*303) 

FORMAT! 1SH  FILE  NOT  FOUND) 

GO  TO  1 

wRITE<b*30A> 

FORMAT (/36h  *TYp£  1 FOR  NORMAL*  0 FOR  LOGNORMAL) 

READ  < hi  305 | K 
FORMAT | II ) 

CALL  SEEK ( 5*  F ILE  ) 

READ ( 5*  1 01  ) TITl 
FORMAT( 1 A AS  I 
wRITE!  7*201 )TITl 
FORMAT! lHl / 1 4A5 ) 

IF  (K.eO*0)  GO  TO  3 
wR I TE ( 7*  202 > 

FORMAT ( /29h  NORMAL  DISTRIBUTION  ANALYSIS) 

GO  TO  a 
WRITE! 7*203) 

FORMAT ( /32h  LOGNORMAL  DISTRIBUTION  ANALYSIS) 

READ !5>l02)  XN 
FORMAT(Fi0,2) 

READ ( Si  102 ) TOT 
NN»NF»N*IFlX( XN) 

NS«1 

WR I TE ( 7*  20 A ) N#  TOT 

FORMAT ( /20H  NO*  OF  OBSERVAT I ONS* 5X* 1 3// 

1 1 2H  TOTAL  COUnT>5X#F10*2/) 

IF  (K*eQ*0)  GO  TO  11 
WRITE! 7*205 ) 

FORMAT! 11H  P » AX  ♦ B) 

GO  TO  1 2 
WRITE ( 7*206 ) 

FORMAT ( 15H  P • ALN(X)  ♦ B) 


I 


I 


11 

206 


c 

C READ  X AM)  Y AND  CALCULATE  PROBITS 
C 

1.-J  oo  a i i-ijn 

RF  AD ( 5* 1 /2  ) X ( 1 ) 

RFAD(5/102)  Y(I) 

IF  ( ( TOT-Y ( I ) ) -LT  • • 001  > GO  TO  38 

PR<  I )«Y  < I ) /TOT 

If  ( PR ( I ) .LE« .5  ) Q*PR ( I ) 

If  (PR( I ) ,qT.  .5)  &■  1 • “PR ( I I 
T»SQRTi Al03( 1 • / ( U*Q I ) ) 

PTEMP-T-lC0*T*(rl^T*C2>)/( 1 . *T * ( D1 ♦ T* ( D2  + C 3* T > » ) 

IF  (P«(  I ) .lE*  .5)  PT ( I I-5.-PTEMP 
IF  (PR( I ) .qT.  .5  ) PT ( I (-5.+PTEMP 
GO  TO  3^ 

33  NN-NF»n-1 

39  If  (K.E'Q»0)  GO  TO  40 

XZ(  I )-X<  I 1 
GO  TO  U 

*4  XZ  I I )»ALOG(X(  I ) 1 

41  CONTINUE 

c 

C CALCULATE  sums 

c 

43  SNX»SMx2«SMP-SMP2«SMPX-0. 

NUSE  »0 

DO  45  I “NS*  NF 
NUSE-NuSE ♦ 1 
SMX-SMx+XZ ( I > 

SMX2«SmX?*xZ( I )*XZ<  I ) 

SMP-SMP  + PT ( I ) 

SMP2-SmP2-*-PT  ( I ) *PT  ( I ) 

*5  SMPX«SMpX+pT  ( I I *XZ  ( I ) 

c 

C CALCULATE  PARAMETERS  OF  distribution 

c 

tn-floaT inuse  ) 

*RITE( 7 * 20  7 INUSE 

237  FORMAT ( ////38H  nO»  OF  OBSERVATIONS  USED  IN  ANALYSIS  >13//) 
A ■ ( TN*sNPX-SMP*SMX )/(TN*SMX2-SMX*SMX) 

B- ( SMP-A*SmX ) /Tn 

^■(  TN*sMx2-SMX*SITIK  I * < TN*SMP2-SMP*SMP  I 
R-( TN*S^PX-SMX*SMP  ) /SORT ( W ) 
wRITE (7* 208  I AxR 

238  FORMAT ( 5h  a » #F10*4/5X#5H  B • #F10*4/I 
XM»( 5»-B  )/A 

S®1 »/A 

wRITE(7'209>XM*S 

209  FORMAT ( 7h  mEAN  F 10 • 4/ 

l 2iH  standard  deviation  -#fi0.4/i 
IF  (K.eQ»1 ) GO  TO  59 
XMODE«E*P<  XM-S*S I 
XMED-ExP«XM» 

XMD«EXP(XM>»5*S*SI 
WRITE( 7*210 )XMOnE/XMED#XMO 

210  FORMAT ( 20H  MODE  ■ EXP(M-S*S)  • *F10*4/ 

1 18H  MEDIAN  ■ fXP(MI  » ;F10*4/ 


m 


c 37m  f.;  / p t.  C T “ I l 'I N VAlUE  = E X P t M + ,)  • 5 S * 3 ) = * F 1 1 • 4 ! 

S)  l'O  M I =\S  i NF 

61  CPT ( I > = < A*xZ ( I ) +3  > “PT  < I > 

*«ITE(6»3U  ) 

311  F.3«MaT(/3Pm  DIFFERENCE  < aZ  + 3)  - PROFIT) 

»XITE<6'318>  < X ( I ) / DP  T ( I )/I=NS/NF  ) 

312  FOPRATiSlFz^lfFZ'S)  ) 

»RITE(6'3l8>  « 

318  F0RMAT(27H  CORRELATION  COEFFICIENT  = jF8.VI 

Ir(LP*E.Q«il)  GO  TO  62 
wRITEl 7 * 3 1 1 ) 

wRITE(70l2>  < X ( I ) / DPT  ( I )/I=nS/NF> 

WRIT£(701«)  R 
C 

c option  for  omitting  observations 
c 

62  (*RITE<6'313> 

313  F0RMAT(//46H  *0«IT  OBSERVATIONS  - TYPE  1 FOR  YES/  0 FOR  NO) 
READ  ( 4/  305  ) M 

IF ( M • EQ • 0 ) GO  TO  63 
WRITE(6015> 

315  FORMAT ( 19M  *HOw  MANY  AT  START) 

R£AD!4/3l6)  NS 

N 5 = N S + l 

316  FORMAT ( 12) 

* R I T E ( fa  * 3 1 7 1 

317  FORMAT ( 17H  *HOR  MANY  AT  END) 

READ(<f/3l6)  NE 

N F = N N " N E 
60  TO  t*  3 
C 

c CALCULATE  PROBABILITIES  FROM  MEAN  AND  STANDARD  DEVIATION 
C 

63  DO  64  I=l/N 

Z= ( XZ ( I )-Xm)/S 
T = 1 • / ( 1 • + A R * Z ) 

IF(Z»LT*0» ) T = 1 ./ ( 1 *-AP*Z ) 

F=EXP( -Z*Z/2» ) /SORT <2. *3 *141 59) 

P=1.-F«T*(a1+T*(A2+A3*T) ) 

IF(Z*LT*0«)  P*1«"P 

64  DP«( T )*P-PR( I ) 

C 

c print  out  results 
c 

yRITE(7»2U  ) 

211  FORMAT ( ///&X/ 1HX/ 12X/ 1HY/ 8X/8MX  OR  LNX / 3X / 1 1 HPROB AB I L I T Y / 

1 4X/6hpR0bIT/5X>5HDPR0R/ ) 

DO  65  I 3 1 » N 

65  WRITE(7*2l2>X( I ) .»  Y ( I ) » XZ I I )/PR( I I »PT ( I )/ DPR< I ) 

212  FORMAT! F10.H#3X,F10»1/3X*F10«4/2XF10»5>2X*F1 3 »4/2X/F10*5) 

60  CALL  CL^SEtS) 

GO  TO  1 
END 


APPENDIX  V Input  data  file  for  Fortran  program 


PLATuLET  volume  distribution  FOR  RAT  NO*  1 POST  OIVE 

1 9 • 

3 .1 9 3 7 5 • 

16  • 

8565  • 

2 1 • 

26632* 

26  • 

63667 • 

3 1 • 

81915. 

36  • 

116328. 

61  . 

169379. 

66  » 

180259. 

5 1 * 

2 3 6 7 4 7 . 

56.  ...  . ..... 

2293/5. 

61. 

268736. 

6 6 « 

266766. 

71. 

2 7 8 d 1 8 • 

76. 

289956. 

8 l • 

333312. 

86* 

3 19326. 

91. 

3 1 6 7 > 1 • 

96. 

323697. 

131. 

329312. 

136. 

3951 96 . 


APPENDIX  VI  Lognormal  distribution  analysis  for  platelet  sample  - 
teletype  output. 


MONITOR  T4G 

SA  DKI  5/TT  At  6* 7 

SGLOAD 


L0AD5  T9 A 
>-NORMD 

♦IF  USING  LINE  PRINTER  - TYPE  I FOR  YES  0 FOR  NO 
0 


♦TYPE  DATA  FILE  NAME 
RATI 

♦TYPE  1 FOR  NORMAL*  0 FOR  LOGNORMAL 
0 

PLATELET  VOLUME  DISTRIBUTION  FOR  RAT  NO.  1 POST  DIVE 
LOGNORMAL  DISTRIBUTION  ANALYSIS 
NO.  OF  OBSERVATIONS  19 

TOTAL  COUNT  339075.00 

P * ALN(X)  ♦ B 


NO.  OF  OBSERVATIONS  USED  IN  ANALYSIS  19 


A * 2.0878  B = -2.8742 

MEAN  = 3.7714 

STANDARD  DEVIATION  » 0.4790 

MODE  * EXP(M-S^S)  « 34.5373 

MEDIAN  « EXP(M)  = 43.4428 

EXPECTATION  VALUE  * LXP( M* 0 . 5S^ S ) = 48.7229 


DIFFERENCE  ( AZ  ♦ B)  - PROBIT 


16.0 

-0.128 

21.0 

-0.057 

26.0 

-0.020 

3 1.0 

-0.004 

36.0 

0.011 

41.0 

0.025 

46.0 

0.040 

51.0 

0.057 

56.0 

0.073 

6 1.0 

0.08  5 

66.0 

0.099 

7 1.0 

0.108 

76.0 

0.109 

8 1.0 

0.101 

8 6 . f 1 

0.076 

9 1.0 

0.036 

96.0 

-0.037 

101.0 

-0.161 

106.0 

-0.414 

CORRELATION  COEFFICIENT  * 0.9941 


♦HOW  MANY  AT  START  ! 

0 

♦HOW  MANY  AT  END 
4 


NO.  OF  ObSERV AT  I ONS  ubED  IN  ANALYSIS  15 


A * I .9586  b = -2.4187 

MEAN  * 3.7877 

STANDARD  DEVIATION  = 0.5106 

MODE  = EXF(M-S^S)  = 34.0211 

MEDIAN  = EXP(M)  = 44.1525 

EXPECTATION  VALUE  = EXP  < M+- 0 . 5S^S ) * 50.2990 

DIFFERENCE  < AZ  ♦ B>  - FRObIT 


16.0 

-0.031 

21.0 

0.005 

26.0 

0.014 

31.0 

0.006 

36.0 

0.004 

41.0 

0.000 

46.0 

0.001 

51.0 

0.004 

56.0 

0.008 

61.0 

0.009 

J 

66.0 

0.013 

7 1.0 

0.013 

76.0 

0.005 

8 1.0 

-0.011 

86.0 

-0.044 

/ 

CORRELATION  COEFFICIENT  = 0.9999 


♦OMIT  OBSERVATIONS  - TYPE  I FOR  YES,  0 FOR  NO 
0 


X 

Y 

X OR  LNX 

PROBABILITY 

PROBI T 

DPROB 

16.00 

8545.0 

2.7726 

0.02520 

3.0430 

-0.001 79 

21.00 

24432.0 

3.0445 

0.07205 

3.5391 

0.00070 

26.00 

49667.0 

3.258 1 

0. 14648 

3.9483 

0.00333 

31 .00 

81915.0 

3.4340 

0.24158 

4.299  1 

0.00267 

36.00 

1 16326.0 

3.58  35 

0.34307 

4.5964 

0.001 58 

4 1.00 

149879 .0 

3.7136 

0.44202 

4.8545 

0.00029 

46.00 

180259 .0 

3.8286 

0.53162 

5.079  1 

0.00039 

51.00 

206747.0 

3.9318 

0.60974 

5.2782 

0.00144 

56.00 

229375.0 

4.0254 

0.67647 

5.4574 

0.00276 

61.00 

248  78  4.0 

4.1109 

0.7337  1 

5.6237 

0.00294 

66.00 

264744.0 

4.1897 

0.78  078 

5.7746 

0.00369 

7 1.00 

278218 .0 

4.2627 

0.82052 

5.9173 

0.00340 

76.00 

289956.0 

4.3307 

0.85514 

6.0587 

0.001*4 

8 1.00 

300032.0 

4.3944 

0.88485 

6.1997 

-0.002 ) 6 

8 6.00 

309024.0 

4.4543 

0.9 1 1 37 

6.3495 

-0.00^ • 7 

9 1.00 

316741 .0 

4.5109 

0.93413 

6.5076 

-0.01244 

9 6.00 

323697.0 

4.5643 

0.95465 

6.6921 

-0.0:874 

101.00 

329812.0 

4.6151 

0.97268 

6.9222 

-0.02522 

106.00 

335196.0 

4.6634 

0.98856 

7.2759 

-9. 03 » 
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APPENDIX  VII 


Normal  distribution  analysis  for  platelet  sample  - 
line  printer  version. 


a)  Teletype  Printout 
TS  Z2b- A 


MONITOR  TAG 

$ A DKI  5/TT  4,6/LP  7 

SGLOAD 

LOADS  T9A 
»*NORMD 

♦IF  USING  LINE  PRINTER  - TYPE  I FOR  YES,  0 FOR  NO 
1 


♦TYPE  DATA  FILE  NAME 
RATI 


♦TYPE  I FOR  NORMAL,  0 FOR  LOGNORMAL 
1 


DIFFERENCE  ( AZ 
16.0  0.S6A 

♦ B) 
21.0 

- PROB 
0.272 

IT 

26.0 

0.066 

3 1.0 

-0.08  0 

36.0 

-0.174 

41 .0  -0.228 

46.0 

-0.248 

51.0 

-0.243 

56.0 

-0.218 

6 1.0 

-0.181 

66.0  -0.128 

71  .0 

-0.066 

76.0 

-0.004 

8 1.0 

0.059 

86.0 

0.114 

91.0  0.160 

96.0 

0.179 

101.0 

0.153 

106.0 

0.003 

CORRELATION  COEFFICIENT  * 

0 

.9835 

♦OMIT  OBSERVATIONS  - TYPE  I FOR  YES,  0 FOR  NO 
0 


♦TYPE  DATA  FILE  NAME 
♦ C 

TS  Z2B-A 


MONITOR  TAG 
S 


-Ana  Printer  Printout 


PLATELET  VOLUME  DISTRIBUTION  FOR  PAT  no*  1 POST  DIV<- 
NORMAL  DISTRIBUTION  ANALYSIS 
no*  of  odsfrv at  ions  is 

total  count  339075*w<;' 

P » AX  ♦ H 


Nr).  iJF  OBSERVATIONS  uSFO  IN  ANALYSIS  19 


A a 0*0408  B - 2*9538 

MFAN  - 50.1447 

STANOARD  DEVIATION  ■ 24.5059 


difference  iaz 

16*0  0*564 

+ P)  - PROBIT 
2 1 • 0 0*272 

26*0 

0*066 

31*0  -0*080 

36*0  -0. 

174 

41*0  *0 • 

46.0  —0*248 

51 .0 

-0.243 

56*0  -0*218 

61*0  -0* 

181 

66*0  -0*128 

71.0  -0.066 

76*0 

-0.004 

81*0 

0*059 

86*0  0* 

114 

91*0  0*160 

96.0  0.179 

101  .0 

0.153 

106*0 

0*003 

CORRELATION  COEFFICIENT  - 0*9835 


X 

Y 

X OR  LNX 

PROBABILITY 

PROBIT 

DPROB 

16*00 

8545  * 0 

16*0000 

0*02520 

3*0*30 

0*05655 

21.00 

24*32.0 

21*0000 

0*07205 

3*5391 

0*04510 

26.00 

49667.0 

26*0000 

0*  14648 

3*9*83 

0*01576 

31.00 

81915.0 

31 *0000 

0*2*158 

4 • 2991 

-0*02425 

36.00 

116328.0 

36*0000 

0*3*307 

4*5964 

-0 • 06  life 

41.00 

149879.0 

41 *0000 

0 • **202 

4*8545 

-0*08750 

46*00 

180259.0 

46*0000 

0*53162 

5*0/91 

-0*09878 

51.00 

206747. 0 

51*0000 

0*6097* 

5*2782 

-0*09581 

56.00 

229375.0 

56*0000 

/ • 676*7 

5*4574 

-0*08205 

61.00 

2 4 8 7 8 4 . 0 

61 *0000 

0*  73371 

5*6237 

-0*06262 

66.00 

264  7 H 4 . 0 

66  • 0000 

0*78078 

5*7746 

-0*03961 

71.00 

27h2 18.0 

71 • 1000 

0*82052 

5*9173 

-0*01790 

76.00 

289956.0 

76 • 0000 

0*8551* 

6*0587 

-0*00083 

81  .Y  0 

300032.0 

81 . 0000 

0*88*85 

6*1997 

0*01116 

86.00 

309024.0 

86 • 0000 

0*9113/ 

6*3495 

0*01692 

31 .00 

31*741 .0 

91  * 0000 

0*93413 

6*5/76 

0*01813 

96.00 

323697.0 

36*0000 

0 • 95465 

6*6921 

0*01469 

im  .00 

329*12.0 

101 • 000/ 

0 • 9 7 2n  s 

6*9222 

0*00893 

l 06 .00 

335196.0 

1 06*  0000 

0 • 98856 

/ • 2 759 

0*00010 

